Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement
نویسندگان
چکیده
In the modulation-filtering based speech enhancement method, noise suppression is achieved by bandpass filtering the temporal trajectories of the power spectrum. In the literature, some authors use the power spectrum directly for modulation filtering, while others use different compression functions for reducing the dynamic range of the power spectrum prior to its modulation filtering. This paper compares systematically different dynamic range compression functions applied to the power spectrum for speech enhancement. Subjective listening tests and objective measures are used to evaluate the quality as well as the intelligibility of the enhanced speech. The quality is measured objectively in terms of the Perceptual Estimation of Speech Quality (PESQ) measure and the intelligibility in terms of the Speech Transmission Index (STI) measure. It is found that P 0.3333 (power spectrum raised to power 1/3) results in the highest speech quality and intelligibility.
منابع مشابه
Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملSpeech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering
This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...
متن کاملEnhancement of Noise Performance in Digital Receivers by Over Sampling the Received Signal
In wireless channel the noise has a zero mean. This channel property can be used in the enhancement of the noise performance in the digital receivers by oversampling the received signal and calculating the decision variable based on the time average of more than one sample of the received signal. The averaging process will reduce the effect of the noise in the decision variable that will approa...
متن کاملDesign of a High Range, High Efficiency Spread Spectrum Transmitter for Audio Communication Applications
This work proposes a direct sequence spread spectrum transmitter with high transmission range and efficiency for audio signals. It is shown that by choosing high process gain for spread spectrum signal the data could reach a range of 55km in the 2.4GHz ISM band. By employing a light modulation scheme, we have a relaxed SNR requirement for having a low bit error rate (BER) which translates to re...
متن کاملOptimal Design of UPFC Output Feed Back Controller for Power System Stability Enhancement by Hybrid PSO and GSA
In this paper, the optimal design of supplementary controller parameters of a unified powerflow controller(UPFC) for damping low-frequency oscillations in a weakly connected systemis investigated. The individual design of the UPFC controller, using hybrid particle swarmoptimization and gravitational search algorithm (PSOGSA)technique under 3 loadingoperating conditions, is discussed. The effect...
متن کامل